A Unified Framework for Information Extraction from Newspaper Images
نویسندگان
چکیده
Nowadays Newspapers are very common source of information which is easily available to all. It consists of all sorts of news like social news, political news and lots of advertisements. These advertisements/announcements are concentrated on some specific page. This paper proposes a system that can extract contact information like email address, website address and telephone number from newspaper advertisements regarding job, contract, biding and other announcements of company. Proposed system will be able to store old advertisements details for future references. It is very easy for human being to spot the words in an image but it takes lots of computation for a computer to extract and separate these words. This paper explains the necessary steps which are required to recognize optical characters like segmentation, smoothing, image processing and neural network implementation for image recognition.
منابع مشابه
Unified subspace analysis for face recognition - Computer Vision, 2003. Proceedings. Ninth IEEE International Conference on
We propose a face difference model that decomposes face difference into three components, intrinsic difference, transformation difference, and noise. Using the face difference model and a detailed subspace analysis on the three components we develop a unified framework for subspace analysis. Using this framework we discover the inherent relationship among different subspace methods and their un...
متن کاملUnified Subspace Analysis for Face Recognition
We propose a face difference model that decomposes face difference into three components, intrinsic difference, transformation difference, and noise. Using the face difference model and a detailed subspace analysis on the three components we develop a unified framework for subspace analysis. Using this framework we discover the inherent relationship among different subspace methods and their un...
متن کاملReflection of Knowledge and Information Science’s News in the Press: A Case Study of Iran Newspaper
Background and Aim: The present study aims to explore the coverage and reflection of Knowledge and Information Science news in the Iranian press. Iran Newspaper which is one of the main public newspapers in the country has been selected as the case for this study. Method: This study used content analysis as its research methodology and adopted an inductive approach in data analysis. All the pag...
متن کاملObject-Oriented Method for Automatic Extraction of Road from High Resolution Satellite Images
As the information carried in a high spatial resolution image is not represented by single pixels but by meaningful image objects, which include the association of multiple pixels and their mutual relations, the object based method has become one of the most commonly used strategies for the processing of high resolution imagery. This processing comprises two fundamental and critical steps towar...
متن کاملNewspaper Headlines Extraction from Microfilm Images
Automatic indexing is important for a digital library to provide digitized manuscripts of old document images and their electronic text. As an essential step in creating such a system, this paper discusses the issue of extracting headlines from old newspaper microfilms. Most research on document layout analysis has largely assumed relatively clean images. However microfilm images of old newspap...
متن کامل